Provenance in Manually Curated Databases
نویسندگان
چکیده
Many curated databases are constructed by scientists integrating various existing data sources. Most current approaches to provenance in databases are based on views and fail to take account of the added value of the work done by scientists in manually creating and modifying data. Capturing provenance in such an environment is a challenging problem, requiring changes in practice, changes to existing software, and crucially, a good model of the process of curation.
منابع مشابه
A Provenance Model for Manually Curated Data
Many curated databases are constructed by scientists integrating various existing data sources “by hand”, that is, by manually entering or copying data from other sources. Capturing provenance in such an environment is a challenging problem, requiring a good model of the process of curation. Existing models of provenance focus on queries/views in databases or computations on the Grid, not updat...
متن کاملImprov: Flexible Data Provenance for Relational Databases
Curated databases, which consist of data extracted from original sources, printed articles, and other databases, are a valuable source of data for scientists. However, as curated databases aggregate information from multiple sources, the origin of the data elements can be lost. Because of this, curated databases often provide support for data annotations, which are pieces of extra information a...
متن کاملA Copy-and-Paste Model for Provenance in Curated Databases
Provenance is information describing the origin, construction, location, ownership, or other aspects of the history of an object. Previous work on provenance has concentrated on an understanding of how provenance is described when the data of interest has been derived by queries from other data sources, as is the case in data warehouses. In this paper we focus on another important class of data...
متن کاملOn InChI and evaluating the quality of cross-reference links
BACKGROUND There are many databases of small molecules focused on different aspects of research and its applications. Some tasks may require integration of information from various databases. However, determining which entries from different databases represent the same compound is not straightforward. Integration can be based, for example, on automatically generated cross-reference links betwe...
متن کاملWhy and Where: A Characterization of Data Provenance
With the proliferation of database views and curated databases, the issue of data provenance { where a piece of data came from and the process by which it arrived in the database { is becoming increasingly important, especially in scienti c databases where understanding provenance is crucial to the accuracy and currency of data. In this paper we describe an approach to computing provenance when...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006